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© Paging supervisor. 



© In a computer system having a virtual storage memory with pages therein and including a real storage area 
and a backup storage area employs the method of allocating the storage of information between the real storage 
area and the backup storage area including the steps of generating a classification of information stored in the 
real storage area as volatile storage and selectively removing the volatile storage information from the real 
storage area without storing it elsewhere in the system. 
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PAGING IN CACHED DATA PROCESSING APPARATUS 



The present invention relates to paging in cached data processing apparatus and provides a new data 
attribute, volatile storage, for computer systems with virtual storage. Volatile storage contains data that are 
not swapped out to secondary storage. For many applications this is a useful feature which can result in 
reduced system overhead. 

5 Virtual storage extends the power of computer storage by expanding the number of storage addresses 

that can be represented in a system while relaxing the limitation that all addressable storage must be 
present in the computer's main store. The address translation hardware employed for such virtual storage 
requires page tables fixed in real memory to perform its function. The size of a conventional page table is 
proportional to the size of the virtual address space, placing a practical limit on the address space size. 

10 Paged segmentation is a means of reducing this overhead. It takes advantage of the grouping of related 

data in virtual storage by representing page table data separately for each segment This allows space to 
be saved for short or unused segments of the address space. 

An inverted page -table further expands the range of addressability by reducing the real storage 
overhead required to support a very large virtual address space. Since an inverted page table contains an 

is entry for each page of real memory, its overhead is proportional to physical rather than virtual memory size. 
This is possible because the address translation hardware needs only the location of pages that are present 
in real memory. If a page is not present, the hardware must detect this fact, but it does not require the 
backing store address. 

The operating system's paging supervisor program does need this information, but appropriate coding 
20 techniques can allow this program to take page faults when accessing the external page tables which 
describe the backing store mapping for each virtual page. Thus, the external page tables need not be fixed 
or pinned in real storage. The frequency with which these tables are referenced will cause them to occupy 
some fraction of real memory.* however. Furthermore, the presence of external page tables in the virtual 
address space requires additional external page table entries for each page containing external page table 
25 data. Ultimately, at least one external page table must be pinned to avoid infinite recursion in the paging 
supervisor. 

The richness of the addressing resource provided using the inverted page table approach simplifies 
storage management in most software. Both application and system routines can be coded using safe, 
straightforward data structures. Large arrays, which can be allocated statically at compile time, are 

30 particularly useful for representing the objects being processed. Data represented in this manner can be 
accessed very efficiently, while compiler range checking easily ensures that no other data are accidentally 
modified. The requirement that the data structure be bounded is frequently not a problem, since reasonable 
limits may be imposed by the size of the application that can feasibly be attempted, and because the cost 
of providing a large virtual extent is quite modest. As long as the program is coded not to touch storage 

35 containing unused array entries, no real storage will be allocated. 

Many machines with virtual memory support provide hardware bits that record references and changes 
to each page. The change bit is set on any store into the page, while the reference bit is set on any access 
(load or store). A page that has not been written into since it was read from secondary storage will have a 
zero change bit A page in this state is said to be "clean". When the in-storage copy of the page is 

40 modified, the change bit is set and the page becomes "dirty". Dirty pages must be written back to 
seconds^ storage before they can be replaced for other uses, whereas clean pages are immediately 
available 

Having a very large range of addresses makes it feasible to map a system's entire database using a 
single set of virtual addresses. This data management technique is sometimes known as a one-ievel store. 

45 In a one-level store each segment must be targe enough to represent an entire file or collection of data. 
Segment numbers may be permanently assigned to each file, or they may be assigned when the file is 
opened. The primary advantage of this reusable virtual address approach is that a much smaller segment 
table is needed, eliminating the complexities introduced when the segment table must be paged. To resolve 
segment names when a file is opened, a conventional file system directory must be maintained. This 

so provides a convenient place to keep user authorisation and access rights. 

If a one-level store is provided by the system, then it is possible to support the persistent storage class. 
This is a powerful high-level language data abstraction that provides direct access to file-system segments. 
Any data structure can be declared to have the attribute " persistent w . Once a persistent data item has been 
opened, it can be addressed in the same manner as any other variable in the program. Variables in the 
persistent data structure retain the values they had the last time they were committed. New values can be 
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assigned and these changes committed or abandoned, as needed. Before a persistent segment can be 
addressed, it must be opened, binding the segment identified to a specific name in the file system. Various 
sharing, locking, and daxa integrity checks can be made when the variable, is opened. The persistent data 
attribute is a useful generalisation of conventional file access methods, 
s According to the present invention, there is provided data processing apparatus having a cache, 
comprising a finite set of page frames, and backing storage between whicn pages of data are paged under 
the control of a paging supervisor which, inter alia, maintains an inverted page table defining which pages 
reside in which page frames, a free list of which page frames are available to receive non-resicent 
requested pages and a record of the activity that has taken place with respect to each resident oage and 
to which preforms a cyclic housekeeping function on the cache to determine, in a first cycle, which resicent 
pages are inactive in that they have not been referenced since last inspected and. of these, which inactive 
pages have been rendered dirty by being written into, and, in a second cycle, to update the corresponding 
copies of dirty inactive pages in backing storage and thereafter to transfer the corresponding page frames 
to the free list, page frames determined to contain clean inactive pages being available for immediate 
75 transfer to the free list, wherein means are provided for designating that pages in identified page frames are 
volatile, the paging supervisor being responsive to such designations to remove all control data relative to 
such pages from the inverted page table without adding the corresponding page frames to the free list and 
without updating the corresponding copies thereof in backing storage. 

One or more pages of virtual storage may be said to be "volatile" if they exist only whiie in primary 
20 memory. When a volatile page is selected for replacement, it is released rather than being swapped out to 
secondary storage. This is the feature that gives the volatile store of the present invention its name: if a 
page remains unused for too long, it will " evaporate 

For most applications, there must be some assurance that pages currently being accessed do not 
disappear while updates are being made. Although the system is unlikely to select currently active pages 
25 for replacement, the program must work correctly even in situations where higher-priority tasks may pre- 
empt its real storage resources. 

Accordingly, the paging supervisor can be arranged to progress volatile pages from active to inactive in 
a first housekeeping cycle and to remove alt references to inactive volatile pages in a second housekeeping 
cycle and means provided for restoring referenced volatile active or inactive pages to the dirty active state 
30 and for restoring inactive volatile pages to the active volatile state if again specifically designated volatile. 

Most systems use page replacement schemes based on hardware reference bits, which do not 
guarantee that the most recently referenced volatile page will remain in real storage. These reference bits 
provide a good solution to the problem, however. When the application references a page, it becomes 
implicitly non-volatile until explicitly made volatile by a "vrelease". If a page that has been referenced but 
35 not released is selected for pageout. it must be copied to secondary storage. As long as the application 
vreleases its pages, it will gain the performance advantage of volatile storage. Pages which are not 
vreleased will behave like ordinary virtual memory- The application can rely on pages being updated to 
remain intact. It explicitly vreleases pages that are not needed immediately, declaring that it is willing to 
regenerate the page, rf necessary. 
40 When a page or segment is declared to be volatile, the program making the request may declare an 
entry point to perform a pseudo pagein operation. When an unmapped volatile page is referenced, the 
paging supervisor allocates an empty frame and calls the pseudo pagein routine to initialise it to whatever 
empty state is desired by the program. This may be implemented by signalling the task or by sending a 
message to some designated process. 
45 Since the paging supervisor is usually required to zero newly allocated frames to preserve data 
security, some applications may not need a pseudo pagein handler because a zero field can be used to 
detect when a newly allocated page is being processed. 

The present invention will be described further by way of example with reference to an embodiment 
thereof as illustrated, together with an example of a prior art arrangement, in the accompanying drawings in 
so which:- 

FIGURE 1 is a diagram showing the usual transitions between states for an virtual storage pages in 
the prior art example; and 

FIGURE 2 is a diagram showing these states and transitions, together with the additional volatile page 
states and transitions of the embodiment of the present invention. 
55 The described paging arrangement is useful in a computer system such as described in the article 
"Design Of The IBM RT PC Virtual Memory Manager"' by J. C. O'Quin, J. T. O'Quin, Mark D. Rogers and 
T. A. Smith appearing in the publication "IBM RT Personal Computer Technology". 1986. IBM Form No. SA 
23-1057. That article, including the references cited therein, is incorporated herein by reference. 
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Page Replacement -The " Clock " Algorithm 

When a page is in main storage, it resides in a page frame, a buffer capable of holding a single page. 
The page replacement algorithm determines which pages to replace when new page frames are needed. 
5 What is now to be described is an extension to the clock page replacement algorithm, which is also 
commonly known as second chance. The standard version of this algorithm uses the hardware reference 
bits to detect pages that have been inactive since the last time their page frames were scanned. For 
example: 



T5 



20 



25 



* while more frames required do; 

advance frame pointer; 

if end of storage then 

point to first frame; 
endif ; 

35 if frame in use and page not pinned then 

if reference bit set then 



30 



40 



45 



50 



reset reference bit; 



else 



if page modified then 
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initiate page out; 

5 

else 

Jo add frame to free list; 

endif; 

rs endif; 




endif ; 

20 

* end do; 

Page States 

25 

One way of describing a paging subsystem is to describe its page states. Each page must always be 
found in a well-defined state, so any operations that cannot be performed within a single paging subsystem 
critical section will tend to require additional page states. A typical paging subsystem using the clock 
algorithm and keeping a free list of frames available for immediate replacement, might consider the 
30 following states significant: 

1 . UR -unreferenced ■ ^ 

2. DA -dirty, active 

3. Dl -dirty, inactive 

4. PO -page out 

35 5. SS -in secondary storage 

6. PI —page in 

7. CA -clean, active 

8. CI —clean, inactive 

9. FR -free 

40 New pages, which have never been referenced, are in the UR state. When the first reference occurs, a 
page frame is allocated, and initialised to zeros by the paging subsystem. This destroys any other user's 
data and sets the change bit so the page is in the DA state. There is no valid copy in secondary storage. 

Whenever a DA page is noticed by the clock algorithm, its reference bit is reset, putting it in the Dl 
state. As long as the page is referenced again before the next clock cycle, it will return to the DA state and 

*s remain in storage. If a Dl page has not been referenced for a full cycle of the clock, it is selected for 
replacement Since the page is dirty, it must be written to secondary storage before being reallocated. 
During this I/O operation, the page is in the PO stage. When the I/O operation is completed, the page is 
added to the free list from which pages are selected for replacement. FR pages that are replaced no longer 
have a frame assigned. Since the only copy is on secondary storage, their state is SS. 

so When an SS page is referenced, a free frame is assigned, and a read from secondary storage is 
initiated. During this operation the page is in the PI state. When the read completes, the page goes to the 
CA state and is ready for use. CA pages age to the CI state when seen by the clock algorithm. As long as 
they are used again within the next clock cycle, they become CA again. If the reference was a store, the 
page becomes DA instead. If a CI page goes unreferenced for an entire clock cycle, it is placed on the free 

55 list immediately. Since the page is clean, there is already a valid secondary storage copy, so no page out is 
required. 
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There are other, more unusual, state transitions that were omitted from the diagram of Fig. 1 for clarity. 
For example, pages on the free list can be reclaimed if they are referenced again before being replaced. 
Since they are clean, they would go to the CA state. A similar reclaim mechanism aliows PO pages to be 
reclaimed if they are referenced. 

Also, there is usually some mechanism for returning page to the UR state. This release operation might 
be performed when storage is freed, when a program is unloaded, or when a segment's sizes changes. 

The "vrelease" Operation 

When an application program is through using a page, it may perform a "vrelease" operation to inform 
the paging subsystem that it should be released if it is replaced. A vrelease of a page with a frame 
assigned changes the page state to volatile, active (VA). If the page was in secondary storage, it 
immediately becomes UFTand the secondary storage block is released. 

When a VA page is encounted by the clock algorithm, it changes the page state to volatile, inactive (VI). 
If the page remains VI for an entire cycle, its frame will be reallocated, and the page state becomes UR. 

The vrelease operation declares that a virtual storage area's current contents are no longer required. All 
pages within this area become volatile and they are subject to being discarded without notice. If any of 
these pages are referenced again, they immediately become non-volatile again. At that moment, they will 
either contain their original contents or all zeros. Application' programs can use this to determine whether 
the data still remain. 

The vrelease operation can be implemented as follows: 

* for each page contained in the range from vaddr to (vaddr-f-length-1) 
do; 

if I/O in progress for this page 

try to cancel the request; 

wait for cancellation or completion; 
endif ; 



if page frame assigned 

set volatile and active flags in page frame table; 
set hardware reference and change bits to "changed, 
not referenced" * 

else 



set page state to UR; 
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endif; 

if secondary storage block assigned 

free secondary storage block; 
endif; 



* 



end; 



is This technique leaves the page in a state that will be changed if any storage operation sets the 
hardware reference bit. When this bit is set, the volatile and active flags in the page frame table will be 
ignored- This means that the page state is changed to DA on any reference without the overhead of a page 
fault interrupt. 

20 

Clock Algorithm Support for Volatile Pages 

To use this technique, the clock algorithm must be modified slightly to handle the new volatile page 
states: 

25 



30 

* while more frames required do; 

35 

advance frame pointer; 
40 if end of storage 

point to first frame; 

45 

endif; 

if frame in use and page not pinned 

50 



55 
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iference bit set 
reset volatile flag; 
reset reference bit; 

i£ volatile flag set 

if active flat set 

reset active flag; 

else 

set UR page state 
add frame to free list? 
endif ; 

else 

if page modified 

initiate page out; 

else 

add frame to free list; 
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endif; 

5 

endif; 




endif; 

TO 



15 



endif; 
* end do; 



EXAMPLES 



20 Dynamic Storage Management 

The classic "getmain" and "freeman" operations for dynamic storage allocation can make good use of 
volatile storage. These system routines dynamically allocate and free variable-sized areas-of storage from a 
heap in the calling process's virtual address space. Similar functions are provided in most systems; other 

25 common names are new and dispose, or malloc and free. 

When storage is returned to the heap via freemain, and an entire page is no longer allocated, some 
systems will release the page, putting it in the UR state. This avoids paging out useless information. 
However, if the page is used again soon, this strategy increases paging subsystem overhead. When the UR 
page is reused, a page fault interrupt occurs, and a new frame must be allocated. Since -this frame may 

30 have belonged to another process, its contents must be destroyed. Usually, this is done by clearing it with 
binary zeros. 

This overhead can be avoided by performing a vrelease operation on the freed area. If the page is not 
reused anytime soon, it will be selected for replacement, and its contents will be discarded. Should a 
subsequent getmain reuse the page before this occurs, the reference bit will be set and the page will return 
35 to the DA state without causing a page fault interrupt. Since the page already belonged to this process, 
there is no need to destroy its contents. Hence, no time need be spent setting its contents to zero. 



Buffer Management 

40 

A problem that arises with many buffer management techniques is that optimisations within the buffer 
management code may conflict with those made by the virtual storage subsystem. For purposes of the 
present description, assume that the buffers are allocated in virtual storage and are only pinned while I/O 
activity is pending. Since pinned pages cannot be selected for replacement they are a limited resource and 
45 must not be abused. 

If buffers are allocated from a pool in "least recently used" order, there is danger that a free buffer that 
has not been used recently may be paged out by the clock algorithm. If this occurs, then when the "oldest" 
buffer is allocated, its previous contents will be paged in before being thrown away. Because the clock 
algorithm selects frames for replacement when they have not been referenced recently, it is likely that the 

50 next buffer to be allocated will also have been paged out. In such cases, it is quite possible that all 
subsequent allocations will result in two unnecessary I/O operations: a write to preserve unneeded data 
followed by a read to get it back. 

If the contents of freed buffers are never reused, then a vrelease should be done on each buffer page 
as soon as it is freed. This prevents the page from being written to secondary storage while avoiding 

55 unnecessary page fault interrupts if the buffer is reused whiie there is still a frame assigned. Used in 
conjunction with a "most recently used" buffer allocation policy, this scheme minimises paging subsystem 
overhead for the buffer pool. 
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When the buffer contents are likely to be reused, they can be managed like a cache. When looking for 
a cache hit, the buffer page must be inspected to see if some field is non-zero. As long as the buffer does 
not cross a page boundary, the present volatile storage concept ensures that either the data win all be 
present or the whole page will be zero. A control field in the same page with the buffer can be used to 
s detect presence, or some non-volatile control block could save the offset of the first non-zero byte (if any.i 
within the data. 

If the data come from a device that is slower than paging secondary storage, then it may be appropriate 
to permit the buffer cache to be paged out. In this case the vrelease is done not when the buffer is freed 
but when it is reallocated. This eliminates unnecessary page reads in case the previous contents had been 
10 paged out, and avoids unnecessary page fault interrupts when there is still a frame assigned. 

Simulation by way of Stepwise Compilation 

75 Stepwise compilation provides an efficient method for simulating the instruction set of one machine on 
another. This method differs from the conventional simulation technique of decoding an instruction and then 
branching to a subroutine that performs the appropriate operations on the simulated address space and 
CPU state. With stepwise compilation, the routine to perform each decoded instruction is tailored to the 
specific registers and other operands of the simulated instruction and placed into a cell of a "software 

20 cache" from which it can be executed directly. Once a cache cell has been built, subsequent re-execution 
of that instruction need not again incur the decode overhead. Since programs spend much of their time 
looping, the likelihood of reusing a cached instruction is quite high. 

The method described here is the simplest form of simulation using stepwise compilation. There are 
more complex techniques that involve optimising groups of cached instructions. Some of these methods 

25 can also make effective use of a volatile cache, but the additional complexity would confuse the discussions 
of volatile storage management 

By representing the cache in a volatile segment, the simulator avoids paging system overhead. With 
most stepwise compilation techniques, it is not worthwhile to swap a cache page to the backing store, since 
it can be regenerated far more quickly than it can be transferred. Also, the simulator is relieved of the 

30 burden of managing its instruction cache. It creates a cache cell whenever it needs one by storing the 
desired contents into a volatile segment page. If the page is not mapped, a page fault will occur and a new 
page frame will be assigned. 

The simulator depends on referenced pages not disappearing until a "vrelease" is done. Cache entries 
must not be thrown away while they are being created. Similarly, the cache entry that is currently executing 

3S must not suddenly disappear. 

Branching to a page, or storing instructions there, will set the hardware reference bit, ensuring that the page 
will remain until it is vreleased. Whenever the simulated program branches from one cache cell to another, 
the page being left is vreleased. If rt is re-entered in the future, the simulator can always retranslate the 
cache cell if necessary. It can tell that retranslation is needed when the cell's contents have reverted to 

40 zero's. 

The paging supervisor is able to manage cache replacement for the simulator in a very effective 
manner. Since the pager is taking system-wide load and usage factors into account, it is able to select 
pages for replacement when they begin to interfere with other storage resource requirements. A global page 
allocation algorithm will optimise system-wide use of real storage, including simulator cache usage as a 

45 factor, but may not be "fair" to specific users or tasks. A local page replacement policy can be used if it is 
important that the large working set of the simulator not impact the real storage available to other system 
tasks. The paging supervisor is able to make intelligent choices between these conflicting demands 
because it has a broader view of the requirements of the system as a whole than does the simulator. 

If many real storage frames are available, then the cache can be quite large, improving simulator 

so performance by increasing the cache hit ratio. The paging supervisor can limit the number of frames used 
for the cache based on system-wide working set requirements. This number can be adjusted periodically as 
the working set changes. The page replacement strategy may choose volatile pages more readily than 
others, since the cost of replacing them is smaller. 

55 
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SUMMARY 

Volatile storage has been described and its use illustrated by several examples. The dynamic storage 
management and buffer management examples showed how volatile storage can reduce oaging system 
5 overhead. The simulator application takes advantage of the efficiency and data organisation benefits 
provided by the volatile storage attribute. The trend in microprocessors is towards high CPU speed, iarge 
main store, and relatively slow inpuvoutput (I/O) access time. This represents a change in the ratios of the 
relative costs of certain key system components. Significant changes in these cost ratios should be 
reflected in the system design. Volatile storage helps the system take advantage of processor sDeed and 
io storage capacity while minimising impact on an already marginal I/O capacity. 

There has been described, in a computer system employ.ng a virtual storage memory having pages 
therein and including a real storage area and a backup storage area, the method of allocating the storage of 
information between said real storage area and said backup storage area comprising the steps of 
generating a classification of information stored in said real storage area as volatile storage: and selectively 
15 removing said volatile storage information from said real storage area without storing it elsewhere in said 
system. The method can include the step of identifying portions of said volatile storage whicn are being 
accessed by said computer system to prevent removal of said identified volatile storage portions from said 
real storage area without being restored elsewhere in said system and/or the step of adding reference bits 
to selected ones of said pages in said real storage area to prevent removal of said selected pages from 
said real storage without restoring sard selected pages elsewhere in said system and/or the step of 
removing said reference bits from said selected pages to permit removal of said selected pages from said 
real storage area without being restored elsewhere in said system. 

25 Claims 

1. Data processing apparatus having a cache, comprising a finite set of page frames, and backing 
storage between which pages of data are paged under the control of a paging supervisor which, inter alia, 
maintains an inverted page table defining which pages reside in which page frames, a free list of which 

30 page frames are available to receive non-resident requested pages and a record of the activity that has 
taken place with respect to each resident page and which preforms a cyclic housekeeping* function on the 
cache to determine, in a first cycle, which resident pages are inactive in that they have not been referenced 
since last inspected and, of these, which inactive pages have been rendered dirty by being written into, 
and, in a second cycle, to update the corresponding copies of dirty inactive pages in backing storage and 

35 thereafter to transfer the corresponding page frames to the free list, page frames determined to contain 
clean inactive pages being available for immediate transfer to the free list, wherein means are provided for 
designating that pages in identified page frames are volatile, the paging supervisor being responsive to 
such designations to remove all control data relative to such pages from the inverted page table without 
adding the corresponding page frames to the free list and without updating the corresponding copies 

40 thereof in backing storage. 

2. Apparatus as claimed in claim 1 wherein the paging supervisor is arranged to progress volatile pages 
from active to inactive in a first housekeeping cycle and to remove all references to inactive volatile pages 
in a second housekeeping cycle. 

3. Apparatus as claimed in claim 2 wherein means are provided for restoring referenced volatile active 
45 or inactive pages to the dirty active state. 

4. Apparatus as claimed in claim 3 wherein means are provided for restoring inactive volatile pages to 
the active volatile state if again specifically designated volatile. 

50 
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